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Abstract. We establish an asymptotic formula for the number of fc-difference twin primes 
associated with the Hawkins random sieve, which is a probabilistic model of the Eratosthenes 
sieve. The formula for k — 1 was obtained by M. C. Wunderlich [9]. We here extend this to 
k > 2 and generalize it to all /-tuples of Hawkins primes. 



1. Introduction 

The random sieve was introduced by Hawkins j4|5j as follows. Let Si = {2,3,4,5,...}. 
Put Pi = minS'i. Every element of the set SiyjPi} is then sieved out, independently of the 
others, with probability 1/Pi, and S2 is the set of the surviving elements. In general, at the 
n-th step, define Pn = minS'n. We then use 1/Pn as the probability with which to delete the 
numbers in 5„\{P„}. The set remaining is denoted by Sn+i- The Hawkins sieve is essentially 
a probabilistic analogue of the sieve of Eratosthenes. The sequences {Pi,P2, . . . ,Pn, • • •} of 
Hawkins primes mimic the primes in the sense that their statistical distribution is expected to 
be like that of the primes. The primes themselves correspond to one realization of the process. 

A great deal is known about the Hawkins primes. For instance, the analogues of the prime 
number theorem j5l6l9j . Mertens' theorem j6l9| and the Riemann hypothesis j7|8| are true 
with probability 1. We here concern ourselves with the density of A:-difference Hawkins twin 
primes and its generalization to other /-tuples. 

Instead of a sequence of probability spaces, as considered by Hawkins, Wunderlich [5] 
simplified the process in a single probability space. Let X be the space of all sequences of 
integers greater than 1, i.e., X consists of all finite and infinite sequences. The class of all sets 
of those sequences is Q. For a G X, we denote by an the set of elements of a which are less 
than n, i.e., On = a f]{2, 3, 4, . . . , n — 1} and = a\an- 

Definition 1. An element G is called an elementary set if there exists a sequence 
{ai,a2, ■ ■ ■ ,Ofc} G X and an integer n > at such that E consists of all the sequences a that 
satisfy an = {ai, 02, . . . , a^}. E is denoted by {ai, 02, . . . , a^; n}, and if /c = 0, = {.; n} is 
the set of all sequences whose elements are not less than n. 

The probability function is now defined recursively on the class of elementary sets. 

Definition 2. Define a non-negative real-valued function ^ on the class of elementary sets as 
follows: 



(i) M{-;2}) = i, 



(ii) ^({ai,...,afc,n;n + l}) = ~ — j /x({ai, . . . , a^; n}), 

1=1 ^ 

(iii) /u({ai, . . . ,afc;n + 1}) = " (""^ ~ ' ' • '"^'"'J')- 

For any a G X, the analogue of the A:-difference twin prime counting function is defined as 

^x,x+k{x; a) = #{i < X : j G a and j + /c G a}. 
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Wunderlich [9] showed that I[x,x+iix) ~ 2;/(logx)^ almost surely, which is an analogue of 
Hardy and Littlewood's famous conjecture concerning the distribution of the twin primes [3]. 
The absence of the twin prime constant factor here is due to the drawback of the probabilistic 
setting of the random sieve that it contains little arithmetical information about the primes. 
Though the result is not unexpected, it is not easy to establish, as it is, for example, in 
Cramer's model [1], where every number n is independently deleted with probability 1/logn. 
In Section 2, we follow the lines of Wunderlich [9] and extend the result to /c = 2. 



Theorem 1. Almost surely 



'nx,x+2{x) 



(log x)^ 

Theorem 1 requires rather more work than [9l Theorem 4], but the idea is similar and 
straightforward. Nevertheless, it is clear from the proof for k = 2 that as k increases, the 
calculations will become extremely complicated, and the proof for the general case using 
Wunderlich's method is likely to be extremely messy. In Section 3, we therefore develop a 
different approach and establish the following theorem. 

Theorem 2. Almost surely, for any fixed integer k, as x oo 



X 



(log xy 

As we note in Section 3, our approach extends straightforwardly to /-tuples of Hawkins 
primes to yield 

Theorem 3. Let < ki < k2 < ■ ■ ■ < ki^i and denote by Ilx,x+ki,...,x+ki^iix', ct) the number 
of m < X such that the set {m,m + ki, . . . ,m + ki^i} C a. Then as x ^ oo, almost surely 

X 

Ux,X+ku...,X+h^d^) ~ ^^^g ■ 

An immediate corollary of this theorem is 
Corollary 1. For any positive integers d and I > 2, almost surely, as x ^ oo 

'^X,X+d....,X+(l-l)d{x) ~ 7j 77, 

' ' ' ^ ' (logx)' 

which is reminiscent of a recent theorem of Green and Tao [2] on the existence of arbitrarily 
long arithmetic progressions in the primes, proved using powerful techniques from analytic 
number theory, combinatorics and ergodic theory. 

2. Proof of Theorem 1 
We begin the proof by stating a lemma from [9] . 
Lemma 1. For r, s, t non-negative integers, r < t, define 

k-2 

Mk = i+Y,-- 



Then 



l^Jq = (s + i)M^+ ^■■■^ ^ \Mi 



where c{j,r) = r{r + 1) . . . (r + j — l)/(s + 
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As in [9], we define 



yM= n (1-7)- 



j<m,j(^a 

Then P(m G a) = ymict), and if we let C„ be the set of all sequences containing n, 

= E(y„) = j Vndii. 

Wunderlich then obtained the asymptotic formula for the A;-th moment of which is an 
analogue of Mertens' theorem, 

Some simple calculations give 

P(m G a,m + 2 G a) = {l- — ]{y^{af 3— (l- — ]yM' 



m I \ m + lV m 



Define the auxiliary function 



tix,x+2{x;a) = V iymia) \—{l-—\ym{af] ■ 



m+2&a 



In what follows, we write n(x;a) for lix,v+2(a;; a), and if / : M — t- M, we define the usual 
difference operator A applied to / by A/(m) := /(m + 1) — f{m). We have 

{ym+i{oi) - - ^)ym+i(a)2)"^ ifm+lGa, m + 3Ga 



^ ' ^ '0 otherwise. ^ ' 



Hence 



Thus 



E(n(m + 1)) - E(n(m)) = ( 1 ^— )E(y^+i). 

m + 1 



We now wish to evaluate the variance of Ii{n). It is easy to see from ([T|) that 

AE(fl2(m)) = 2(1 ^-—] E(y„+iII(m)) + 0(1). (3) 

It is necessary to find another recursion for ym4.i(a)*II(m; a). We have 

ym+2(a)*n("i + 1;q) = 

(1 - ^yyL+i{^{'m;a) + (y„^+l(a) - - ^)y™+i(a)2)"^) 

if m + 1 G a and m + 3 G a 



(1 — Tihri) Vm+i^i''^'^ o^) if m + 1 G a and m + 3 ^ 
yln+i^{f^'-: Oi) otherwise. 



a 
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Since 

' P(m+lGa,m + 3Ga) = (l-^)(y„+i(a)2-^(l-^)y„+i(a)3) 
P(m + lea,m + 3^a) = ym+i{a) 

^ F{m + 1 ^ a) = 1 - Vm+iia), 
we easily obtain 

Taking i = 3, summing from 1 to n — 1, and using K{yf.^^I[{k)) = 0{k'E{yf._^^)), we have 

Letting i = 2 in (g]) , 

E(y^+2n(m + 1)) - E(y^+ifl(m)) = 

Summing from 1 to n — 1 we obtain 

"l n ( n \ 

X] M3" + ^ ( 5Z ) = ]^ + ( ]^ ) • 

We are now ready to find E(ym4_in(r?T,)). Letting i = 1 in (g]), 

E(y„+2n(m + 1)) - E(y„+ifl(m)) = f 1 - nvl+i) " ^-E(y^+in(m)). 

\ m+1 J m + 1 



So 



n . ^ ^ 2 n ^ 

nVn+Mn)) = El-- ^(2^-) - E -lE(2/in(m - 1)) 

m=2 ^ ^ m=2 



E A/T2 E A/f 3 + ^ ( E A//'4 ) A#2 + A/^S + ( 



m=2 ™ m=2 ™ ^ m=2 ™ ^ " " \^"n/ 

Substituting this into ([HD we have 

_ / n/^ of nf 

From ([2]) and ([S]) we deduce that 

r IE(n(n)) = ^ + ^ + 0(^) 
1 Var(n(n)) = 0(^). 
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Theorem 2 in [9] then imphes that almost surely 



n(n) 



n 



log n 



Now we define 

Then 

^x,x+2{n;a) 



E 



ifm€a, m + 2Ga 
otherwise. 



am{a)bmia), 

m<n 



m + \\ m 



where 



and 



Om(a) 



rm{a) 



ym{a) - ^(1 - ^)ym(a)2 ' 



&m(a) = ym{oi) 



1 



m + 1 



1 - — )2/m(a)^- 



Let Ao(a) = and j4m(a) = XlJLi Oj(Q!). Using Abel's summation we have 

nx,x+2(n;a) = ^ am(a)6m(a) = ^„(a)6„(a) - ^ Am{a){hm+i{a) -bmia)). 



m<n 



Since An[a) = n(n; o), Afibfi ~ (tt,/ log 7T.) (1/ log 7T,) ~ fi/(logfi)^ almost surely. Hence the 
result follows if we can show that 



'm+l "ml 



n 



(log n 



|2 / • 



Firstly, 



1 



< 



E 



1 



6 



y — 



yiM — hyiM ^ yii^^) 



which is 0(m/ log m) from [9] (cf. Theorem 4). Secondly, 

l^m+l ~ ^-m| = 



ym+i(a) 



1 



1 



1 



m + 2 V m + l 



ym+i(Q)^ - ym(a) + 



1 



m + l 



1 

m 



Since 



we obtain 



(1 - ^)ym(a) = ^ym(a) if m G a 

otherwise. 



m-2 „. /„a2i otherwise. 



I m{m+l){m+2 
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So 

\bm-,i{a) - brn{a)\ < I _1_,^(«)2 otherwise. 

Hence 

^ Am{a)\bm+i{a) - bm{a)\ = ^ Am{a)\bm+iia) - bmia)\ 



m<n ni<n 



+ XI ^raia)\bm+iia) - bmia)\ 



~ ^ m ^ m(m + l) 

m<n m<n ^ ' 



Thus 



V ^^(a)|6^+i(a) -6^(a)| = o( V ^ ) +Q( V . 3 ) 

V (logm)^ / V ^-^ m(logm)'^/ 

m<n ^m<n ^ ' ' ^ m<n ^ ° ' ' 



' m<n 



which is easily seen to be 0(n/(logn)^) from the analogue of the prime number theorem for 
the Hawkins random sieve. The result follows. 



3. Proof of Theorems 2 and 3 

In this section we denote m[ii, i2, . . . , i;] € a, where i\ < 12 < ■ ■ ■ < ii-, \>o mean m + 
{ii, 12, . . . , i;} C a, and m + /i ^ a for all /i € [ii, • • • , ^/}- 

Lemma 2. Given a non-negative integer I and = io < ii < i2 < ■ ■ ■ < ii < = A;, define 

m<n 
m[io,n,...,j(+i]6Q; 

T/ien ~ n/(logn)^"'"^ almost surely. 

Proof. We simply write T'(ra) for Tjjp jj^^](ra; a). Let A„ be the event n € a and i?„ be the 
complement of A^, i.e. Bn = A^. We then have 

P„ := P((m+l)[io,ii,...,im] ea) 

= IP(^m+l+ij+i-Bm+ij+i ■ ■ • • • • -Bm+2+io^m+l+io)- 

By the chain rule 

Pm = ^{^m+l+io) 

X]P(-Bm+2+iol^m+l+io) • • • P(-Sm+ii |-Bm+n-l • • • -Sm+2+io^m+l+jo) 
xPl^m+l+ii l-Bm+ii • • • Pm+2+io-^m+l+io) 
X . . . 

xlPl^m+l+ij+i l-Sm+ij+i • • • -6m+2+ij^m+l+ij • • • -Bm+2+io^m+l+io) 
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Um+l 



\ \ ii-l 



m + l) 



Um+l 



X ... 



, 1 , ■ ) (l , ■ ) ••• (l ^Wm+l, 

or, in short, 
Since 



i=0 ^ •'^ j=0 ^ /i=0 ^ lift/ / 



j=0 ^ /i=0 



= \-(k-l- + o(^) + 



we have 



Prom the definition of T{m), 



T{m + 1) - r(m) 

Hence 



1 if (m + l)[io,n, . . . G a 

otherwise. 



AE(r(m)) = E(£|i) -(k-l- l)E(£|i) + o(^-^i^ j + 0(E(8,«,)) 



1 ' 



Summing from 1 to n — 1 yields 



w,/™/ NX n (l + 2)n (k — I — l)n ^( n 



The next step is to estimate the variance of T{n). As in the case of the previous theorem, 
we need to estabhsh a recursion for y\^_^{r{m). For this we have 

yin+iT{m) if (m + 1) ^ a 

yin+2T{m + !) = { (1 - ^Jy'^+i{T{m) + 1) if (m + l)[io, n, • • • , H+i] G « 
(1 - ^yym+iT{m) otherwise. 
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Since 



\m+lia) = l- y„,+i 

'((m + l)[io,ii,...,i/+i] G a) = Pm, 



we deduce that 



= E(y;,+iT(m)(l - y^+i)) + (^1 - E(y;,+i(r(m) + 1)P^) 

E{yl^,Tim){y^+i-P^)), 



+ 1 



or, equivalently, 

AE{yl+,nm)) = (l - E{yl,^M " (l " (l " ;;^) ) IE(y^„+iT(m)). (7) 



Letting 1 = 1 + 4 and recalling that 

Pm = y^^^i -{k-l- l)y^+l, + 0{yl^,„ 



we obtain 



m+l 

So 

n 



ny^\\nn)) = o('£-^^=o 

^ m=2 ^ 

Letting i = / + 3 in ([7]) and summing from 1 to n — 1 we have 



" +0 " 



m=2 ™ ^ m=2 

Finally, substituting z = / + 2 in ([7]) , 



+ 



Thus 



m=2 '"■ m=2 ^ m=2 ^ 



n (2/ + 4)n (k + l)n ^( n 



M2'+4 M2'+5 M2'+5 Vm2'+6 

= ^ (fc-2/-3)n / n \ 

Now, back to the variance of T{n), 

T^(m + l) = i ^'M + 2TM + 1 if (m + l)[io,ii,...,im] € « 
1 T^(m) otherwise. 
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Therefore 

E{T\m + l)) = E(r2(m)(l-P„))+E((r2(m) + 2r(m) + l)P^) 
= E{T\m)) + 2E(r(m)P„) + E(P„). 

And hence 

AE{T\m)) = 2Kiy'+l,T{m))-2ik-l-lMy'+l,T{m)) 
+0{E{y^^^,T{m)))+0{K{yl^l,)). 

From ([S]) we obtain 

^„,^o, f m (k — 21 — 3)m\ 2(k — I — l)m ^( m 



2m 2(2A;-3/-4)m m 



So 



n— 1 „ 71—1 „ / n— 1 

E(T2(n)) = V4?^-(2fe-3/-4) V4?^ + 0( V 



V? ^{l + 2)n'^\ (2A;-3/ -4)n2 



^''-'^-'^"%or^Y (9) 



Combining ([6]) with ([9]) we have 

\ Var(r(n)) = 0(^). 

Theorem 2 in [9] again yields T(n) ~ n/(log n)'"^'^ almost surely as asserted. □ 

The proof of Theorem 2 now follows immediately from Lemma 2 by noting that almost 
surely 

fe-i 

^X,X+k{x) = ^ ^ %ii,i2,...A,fc](^) 

i=0 0<ii<i2<... 

fe-1 

= ^[0,fc] (a^) + X] X] ^[0,n,i2,...A,fc](3;) 

Z=l 0<ii<i2<...<ii<fe 

= (l + o(l))7T7:^ + of 



(loga;)^ \(logx) 
Similarly for Theorem 3, almost surely 



nx,x+k^,...,x+ki_^{x) = Tio^ki,k2,-M-i] (^) + O ( (log 

(1 + 0(1))-^ + of ^ 

(logx)' \(logx)'+^ 
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